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NOVEL MUTATIONS IN HUMAN MLHl AND HUMAN MSH2 GENES USEFUL 

IN DIAGNOSING COLORECTAL CANCER 



INTRODUCTION 

This application claims the benefit of U.S. provisional 
5 application Serial No. 60/105,180, filed October 22, 1998. 

BACKGROUND OF THE INVENTION 

Colorectal cancer (CRC) is one of the most common fatal 
cancers in developed countries, and the worldwide incidence 
is increasing. The United States and the United Kingdom are 

10 high incidence countries, with an estimated 133,500 new cases 
and 55,300 deaths (Parker et al . CA Cancer J. Clin. 1996 46:5- 
27) in the United States and 30,941 cases and approximately 
17,000 deaths in the United Kingdom (HMSO UK Cancer Registry 
Data) . The population lifetime risk is 1 in 25 in the United 

15 States and Northern Europe and thus represents a significant 
public health issue (Sharp et al . Cancer Registration 
Statistics Scotland 1981-1990, Information and Statistics 
Division, The National Health Service in Scotland, Edinburgh 
(1993)) . Identification of people who are predisposed to the 

20 disease would allow targeting of effective preventative 
measures with the aim of reducing the considerable cancer 
related mortality (Burke et al . J. Am. Med, Ass^n. 1997 
227 :915-919) . 

One group of people with a very high colorectal cancer 
25 risk are those who carry germline mutations in genes that 
participate in DNA mismatch repair. hMSH2 (Fishel et al . Cell 
1993 75:1027-1038; Leach et al . Cell 1993 75:1215-1225; U.S. 
Patent 5,591,826) and hMLHl (Bronner et al . Nature 1994 
368:258-261; Papadopoulos et al . Science 1994 2 63:1625-1629; 
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PCT Publication No, WO 95/20678, published on August 3, 1995) 
are the two genes most commonly involved in heredity 
predisposition to CRC, but mutations in hPMSl and hPMS2 also 
occur in a minority of cases (Nicolaides et al . Nature 1994 
5 371:75-80) . Such mutations are usually associated with marked 
familial aggregation of colorectal, uterine and other cancers 
constituting the clinically defined autosomal dominant 
syndrome of hereditary non-polyposis colorectal cancer (HNPCC) 
(Lynch et al . Gastroenterology 1993 104:1535-1549; Liu et al . 
10 Nature Med. 1996 2:169-174; Wijnen et al . Am. J. Hum. Genet. 
1995 56:1060-1066; Mary et al . Hum. Mol . Genet. 1994 3:2067- 
2069; Nystrom-Lahti et al . Nature Med. 1995 1:1203-1206). 
i| However, an appreciable proportion of patients who have early 

^ onset colorectal cancer but who do not fulfill pragmatic 

m 15 criteria for HNPCC (Vasen et al . Dis. Colon Rectum 1991 

34:424-425) also carry mismatch repair gene mutations (Liu et 
M al. Nature Med. 1995 2:169-174; Dunlop et al . Br, Med. J. 1997 

314:1779-1780). Thus, restricting genetic testing to 
a individuals from families fulfilling HNPCC criteria is likely 

20 to exclude a significant fraction of gene carriers in the 
general population. However, screening unselected patients 
with sporadic cancer represents an enormous workload and may 
provide a very low yield of mutation carriers (Liu et al . Nat. 
Med. 1995 1:348-352; Tomlinson et al. J, Med. Genet. 1997 
25 34 : 39-42) . 

It is clear that issues concerning indications for 
genetic testing and interpretation of results are critical in 
hereditary cancer syndromes (Giardiello et al. N. Engl. J. 
Med. 1997 336 : 823-827) . 
3 0 Using a population-based approach, factors indicative 

of the likelihood of identifying patients with mismatch repair 
gene mutations were investigated. Improved approaches to 
mutation detection and the prevalence of detectable mismatch 
repair gene alterations in various screened groups who were 
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not selected on the basis of family history were also 
determined . 



SUMMARY OF THE INVENTION 

An object of the present invention is to provide novel, 
5 variant hMLHl sequences . 

Another object of the present invention is to provide 
novel, variant hMSH2 sequences. 

Another object of the present invention is to provide 
a method of diagnosing hereditary non-polyposis colorectal 
10 cancer in a patient or determining a patient's susceptibility 
to developing hereditary non-polyposis colorectal cancer via 
detection of novel variant hMLHl or hMSH2 sequences or the 
exonic or intronic sequences of the hMLHl and hMSH2 genes. 

Another object of this invention is to provide methods 
15 and compositions for identifying new variants of hMLHl and 
hMSH2 genes . 

Yet another object of the present invention is to 
provide experimental models of hereditary non-polyposis 
colorectal cancer . 



20 DETAILED DESCRIPTION OF THE INVENTION 

To better elucidate the structure of human MLHl and 
human MSH2 genes and to determine possible sites of 
alternative splicing, the genes were cloned and sequenced and 
PGR was used to determine alternate splice products (variants) 
25 and exon/intron boundaries. Elucidation of intron/exon 
boundary sequences revealed that hMLHl is encoded by 19 coding 
exons. The hMLHl gene sequence was determined by PGR. 

The intron/exon structure of the hMLHl is shown below. 
Positions of introns that interrupt the hMLHl cDNA are shown. 
3 0 Exonic sequence is presented in upper case and intronic 
sequence in lower case letters. Exons are numbered from the 
5' end of the cDNA sequence. 
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hMLHl Exon 1 

agacactaaaataattqcrc (SEQ ID N0:1) 

tgaaggcacttccgttgagcatctagacgtttccttggctcttctggcgccaaa (SEQ 
ID NO: 2) 

5 ATGTCGTTCGTGGCAGGGGTTATTCGGCGGCTGGACGAGACAGTGGTGAACCGCATCGC 
GGCGGGGGAAGTTATCCAGCGGCCAGCTAATGCTATCAAAGAGATGATTGAGAACTG ( SEQ 

ID NO: 3) 

gtacggagggagtcgagccgg (SEQ ID NO: 4) 
actcactt.aaaaactacqa (SEQ ID NO: 5) 
10 cttaacgg (SEQ ID NO: 6) 

hMLHl Exon 2 

aatatatacat-.taaaataqttq (SEQ ID N0:7) 
cagactgataaattattttctgtttgatttgccag (SEQ ID NO: 8) 
TTTAGATGCAAAATCCACAAGTATTCAAGTGATTGTTAAAGAGGGAGGCCTGAAGTTGA 

15 TTCAGATCCAAGACAATGGCACCGGGATCAGG (SEQ ID NO: 9) 

gtaagtaaaacctcaaagtagcaggatgtttgtgcgcttcatggaa (SEQ ID NO: 10) 
gaat c acfgacc 1 1 1 c t c t g (SEQ ID NO: 11) 

hMLHl Exon 3 

aaaqattfcgqaaaatqagtaac (SEQ ID NO: 12) 
20 atgattatttactcatctttttggtatctaacag (SEQ ID NO: 13) 

AAAGAAGATCTGGATATTGTATGTGAAAGGTTCACTACTAGTAAACTGCAGTC 
CTTTGAGGATTTAGCCAGTATTTCTACCTATGGCTTTCGAGGTGAG (SEQ ID NO : 14 ) 
gtaagctaaagattcaagaaatgtgtaaaatat (SEQ ID NO: 15) 
cctcctqtgatoacattqt (SEQ ID NO: 16) 
25 c 

hMLHl Exon 4 

aacctttccctttqgtgaQQ (SEQ ID NO: 17) 

tgacagtgggtgacccagcagtgagtttttctttcagtctattttcttttcttccttag 
(SEQ ID NO: 18) 

3 0 GCTTTGGCCAGCATAAGCCATGTGGCTCATGTTACTATTACAACGAAAACAGCT 
GATGGAAAGTGTGCATACAG (SEQ ID NO: 19) 
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gtatagtgctgacttcttttactcatatatattcattctgaaatgtattttgg 
NO:20) 

acctaaatctcaaaataatc (SEQ ID NO: 21) 

hMLHl Exon 5 

5 ttgatat (SEQ ID NO: 22) 

aattttctcttttccccttagq (SEQ ID NO: 23) 
attagtatctatctctctactggatattaatttgttatattttctcattag (SEQ ID 

NO: 24) 

AGCAAGTTACTCAGATGGAAAACTGAAAGCCCCTCCTA7VACCATGTGCTGGCAATCAAG 
10 GGACCCAGATCACG (SEQ ID NO : 25) 

gtaagaatggtacatgggaca (SEQ ID NO: 26) 
ataaattatt.aaaactttqtttq (SEQ ID N0:27) 

hMLHl Exon 6 

ggattttat-t-ttcaaatacttctatQ (SEQ ID NO: 28) 
15 aatttacaagaaaaatcaatcttctgttcag (SEQ ID NO: 29) 

GTGGAGGACCTTTTTTACAACATAGCCACGAGGAGAAAAGCTTTAAAAAATCCAAGT 

GAAGAATATGGGAAAATTTTGGAAGTTGTTGGCAG (SEQ ID NO : 3 0 ) 
gtacagtccaaaatctgggagtgggtctctgagatttgtcatcaaagtaatgtgttctagt 

(SEQ ID NO: 31) 
20 gctcatacattaaacaattactqaac (SEQ ID NO:32) 

hMLHl Exon 7 

ctagtgtatatttttaac (SEQ ID NO: 33) 

aactcttttcttactcttttgtttttcttttccag (SEQ ID NO: 34) 
GTATTCAGTACACAATGCAGGCATTAGTTTCTCAGTTAAAAAA (SEQ ID NO: 35) 
2 5 gtaagttcttggtttatgggggatggttttgttttatgaaaagaaaaaaggggattttt 

aatagtttgct (SEQ ID NO: 36) 
ggtggagataagattatQ (SEQ ID NO:37) 

hMLHl Exon 8 

Gtcagccatqaqacaataaatcc (SEQ ID NO: 38) 
30 ttgtgtcttctgctgtttgtttatcag (SEQ ID NO: 39) 



PATENT 

(SEQ ID 
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CAAGGAGAGACAGTAGCTGATGTTAGGACACTACCCAATGCCTCAACCGTGGACAATAT 
TCGCTCCATCTTTGGAAATGCTGTTAGTCG (SEQ ID NO: 40) 

gtatgtcgataacctatataaaaaaatcttttacatttattatcttggtttatcatt (SEQ 
ID NO: 41) 

5 r!r!at.cacal--i--^t:tt:aaaaacc (SEQ ID NO: 42) 
hMLHl Exon 9 

naaaaactt-raaaatctc (SEQ ID NO: 43) 
ttttctaatag (SEQ ID NO: 44) 

AGAACTGATAGAAATTGGATGTGAGGATAAAACCCTAGCCTTCAAAATGAATGGTTACA 
1 0 TATCCAATGCAAACTACTCAGTGAAGAAGTGCATCTTCTTACTCTTCATCAACC ( SEQ 

ID NO: 45) 

gtaagttaaaaagaaccacatgggaaat (SEQ ID NO: 46) 
ccactcacaaaaaacacccacaq (SEQ ID NO: 47) 

hMLHl Exon 10 

15 cataacttt.at-ataaatatacacc (SEQ ID NO: 48) 

tgtgacctcacccctcaggacagttttgaactggttgctttctttttattgtttag (SEQ 

ID NO: 49) 

ATCGTCTGGTAGAATCAACTTCCTTGAGAAAAGCCATAGAAACAGTGTATGCAGCCTATT 
TGCCCAAAAACACACACCCATTCCTGTACCTCAG (SEQ ID NO: 50) 
20 gtaatgtagcaccaaactcctcaaccaagactcacaaggaa (SEQ ID NO: 51) 
cagatattctatcaqgctctcctc (SEQ ID NO: 52) 

hMLHl Exon 11 

ggacttttl-rtncccctccc (SEQ ID NO: 53) 

actatctaaggtaattgttctctcttattttcctgacag (SEQ ID NO: 54) 
25 TTTAGAAATCAGTCCCCAGAATGTGGATGTTAATGTGCACCCCACAAAGCATGAAG 
TTCACTTCCTGCACGAGGAGAGCATCCTGGAGCGGGTGCAGCAGCACATCGAGAGCAAG 

CTCCTGGGCTCCAATTCCTCCAGGATGTACTTCACCCAG (SEQ ID NO: 55) 
gtcagggcgcttctcatccagctacttctctggggcctttgaaatgtgcccggccaga 

(SEQ ID NO: 56) 
30 nataaaaacrcaaatttt (SEQ ID NO:57) 
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hMLHl Exon 12 

aattatacctcatactacfc (SEQ ID NO:58) 

ttctttcttagtactgctccatttggggacctgtatatctatacttcttattctgagtct 
ctccactatatatatatatatatatatatttttttttttttttttttttaatacag (SEQ 

5 ID NO:59) 

ACTTTGCTACCAGGACTTGCTGGCCCCTCTGGGGAGATGGTTAAATCCACAACAAGTCT 
GACCTCGTCTTCTACTTCTGGAAGTAGTGATAAGGTCTATGCCCACCAGATGGTTCGTA 
CAGATTCCCGGGAACAGAAGCTTGATGCATTTCTGCAGCCTCTGAGCAAACCCC 
TGTCCAGTCAGCCCCAGGCCATTGTCACAGAGGATAAGACAGATATTTCTAGTGGCAGGG 
10 CTAGGCAGCAAGATGAGGAGATGCTTGAACTCCCAGCCCCTGCTGAAGTGGCTGCCAAAA 
ATCAGAGCTTGGAGGGGGATACAACAAAGGGGACTTCAGAAATGTCAGAGAAGAGAGGAC 

CTACTTCCAGCAACCCCAG (SEQ ID NO: 60) 
gtatggccttttgggaaaagtacagccta (SEQ ID NO: 61) 
nntcctttattctotaataaaac (SEQ ID NO: 62) 

15 hMLHl Exon 13 

tacaacccanaaaatttgqc (SEQ ID NO:63) 

taagtttaaaaacaagaataataatgatctgcacttccttttcttcattgcag (SEQ ID 
NO: 64) 

AAAGAGACATCGGGAAGATTCTGATGTGGAAATGGTGGAAGATGATTCCCGAAAGGAAA 
20 TGACTGCAGCTTGTACCCCCCGGAGAAGGATCATTAACCTCACTAGTGTTTTGAGTCTCCAG 
GAAGAAATTAATGAGCAGGGACATGAGG (SEQ ID NO: 65) 

gtacgtaaacgctgtggcctgcctgggatgcatagggcctcaactgccaa (SEQ ID NO: 
66) 

gattttagaaatqaaaaaaa (SEQ ID NO: 67) 

2 5 hMLHl Exon 14 

tgatatctctacfttctcfQ (SEQ ID NO: 68) 

tgcctggtgctttggtcaatgaagtggggttggtaggattctattacttacctgttttt 
tggttttattttttgttttgcag (SEQ ID NO: 69) 

TTCTCCGGGAGATGTTGCATAACCACTCCTTCGTGGGCTGTGTGAATCCTCAGTGGGCCTTG 

3 0 GCACAGCATCAAACCAAGTTATACCTTCTCAACACCACCAAGCTTAG (SEQ ID NO: 70) 

gtaaatcagctgagtgtgtgaacaa (SEQ ID NO: 71) 
acaaaact,actacaacaata (SEQ ID NO: 72) 
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iiMLHl Exon 15 

cccatttgtcccaactgg SEQ ID NO: 73 

ttgtatctcaagcatgaattcagcttttccttaaagtcacttcatttttattttcag (SEQ 

ID NO: 74) 

5 TGAAGAACTGTTCTACCAGATACTCATTTATGATTTTGCCAATTTTGGTGTTC 
TCAGGTTATCG (SEQ ID NO: 75) 

gtaagtttagatccttttcactt (SEQ ID NO: 76) 
ctgacatttcaactgaccg (SEQ ID NO: 77) 

hMLHl Exon 16 

10 catttagatgctccgttaaagc (SEQ ID NO: 78) 

ttgctccttcatgttcttgcttcttcctag (SEQ ID NO: 79) 

GAGCCAGCACCGCTCTTTGACCTTGCCATGCTTGCCTTAGATAGTCCAGAGAGTGGCTG 
GACAGAGGAAGATGGTCCCAAAGAAGGACTTGCTGAATACATTGTTGAGTTTCTGAAGA 
AGAAGGCTGAGATGCTTGCAGACTATTTCTCTTTGGAAATTGATGAG (SEQ ID NO: 80) 
15 gtgtgacagccattcttatacttctgttgtattctc (SEQ ID NO: 81) 
caaataaaatttccagccgggtg (SEQ ID NO: 82) 

hMLHl Exon 17 

ggaaaggcactggagaaatggg (SEQ ID NO: 83) 

atttgtttaaactatgacagcattatttcttgttcccttgtcctttttcctgcaagcag 
20 (SEQ ID NO: 84) 

GAAGGGAACCTGATTGGATTACCCCTTCTGATTGACAACTATGTGCCCCCTTTGGAGGG 
ACTGCCTATCTTCATTCTTCGACTAGCCACTGAG (SEQ ID NO: 85) 

gtcagtgatcaagcagatactaagcattt (SEQ ID NO: 86) 
cggtacatgcatgtgtgctggaggg (SEQ ID NO: 87) 

25 hMLHl Exon 18 

taagtagtctgtgatctccg (SEQ ID NO: 88) 

tttagaatgagaatgtttaaattcgtacctattttgaggtattgaatttctttggaccag 
(SEQ ID NO: 89) 

GTGAATTGGGACGAAGAAAAGGAATGTTTTGAAAGCCTCAGTAAAGAATGCGCTATGTT 
3 0 CTATTCCATCCGGAAGCAGTACATATCTGAGGAGTCGACCCTCTCAGGCCAGCAG ( SEQ 
ID NO:90) 
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gtacagtggtgatgcacactggcaccccaggacta (SEQ ID NO: 91) 
ggaraaaar.r-.t-.ratacat (SEQ ID NO: 92) 

hMLHl Exon 19 

ganaricaatat.atattgq (SEQ ID NO: 93) 
5 gatgcaaacagggaggcttatgacatctaatgtgttttccag (SEQ ID NO: 94) 

AGTGAAGTGCCTGGCTCCATTCCAAACTCCTGGAAGTGGACTGTGGAACACATTGTC 
TATAAAGCCTTGCGCTCACACATTCTGCCTCCTAAACATTTCACAGAAGATGGAAATATC 
CTGCAGCTTGCTAACCTGCCTGATCTATACAAAGTCTTTGAGAGGTGTTAA (SEQ ID 

NO: 95) 

10 atatggttatttatgcactgt (SEQ ID NO: 96) 
aaaahatar^nl-.l-.ctttctc (SEQ ID NO: 97) 

tgtattccgatacaaagtgttgtatcaaagtgtgatatacaaagtgtaccaacataagtg 
(SEQ ID NO: 98) 

Elucidation of intron/exon boundary sequences revealed 
15 that hMSH2 is encoded by 16 coding axons. The hiyiSH2 gene 
sequence was determined by PGR. 

The intron/exon structure of the hMSH2 is shown below. 
Positions of introns that interrupt the hMSH2 cDNA are shown. 
Exonic sequence is presented in upper case and intronic 
20 sequence in lower case letters. Exons are numbered from the 
5' end of the cDNA sequence. 



hMSH2 Exon 1 

ggcgggaaacagcttagtgggtgtggggtcg (SEQ ID NO: 99) 

nacattttrttcaaccagqa (SEQ ID NO: 100) 

25 ggtgaggaggtttcgac (SEQ ID NO: 101) 

ATGGCGGTGCAGCCGAAGGAGACGCTGCAGTTGGAGAGCGCGGCCGAGGTCGGCTTCGTG 

CGCTTCTTTCAGGGCATGCCGGAGAAGCCGACCACCACAGTGCGCCTTTTCGACCGGGG 
CGACTTCTATACGGCGCACGGCGAGGACGCGCTGCTGGCCGCCCGGGAGGTGTTCAAGA 
CCCAGGGGGTGATCAAGTACATGGGGCCGGCAG (SEQ ID NO: 102) 
30 gtgagggccgggac (SEQ ID NO: 103) 

aacacatantaagqaqq (SEQ ID NO: 104) 
gac 



DEX-0054 



- 10 - 



PATENT 



hMSH2 Exon 2 

gaa 

gtccagctaatacagtgcttg (SEQ ID NO:105) 

aacatgtaatatctcaaatctgtaatgtactttttttttttttaag (SEQ ID NO: 106) 
5 GAGCAAAGAATCTGCAGAGTGTTGTGCTTAGTAAAATGAATTTTGAATCTTTTGTAAAA 
GATCTTCTTCTGGTTCGTCAGTATAGAGTTGAAGTTTATAAGAATAGAGCTGGAAATAAG 
GCATCCAAGGAGAATGATTGGTATTTGGCATATAAG (SEQ ID NO: 107) 

gtaattatcttcctttttaatttacttattttt (SEQ ID NO: 108) 
ttaagagtagaaaaataaaaatgtg (SEQ ID NO: 109) 
10 aag 

hMSH2 Exon 3 

ATTAATAAGGtTCATAGAGTTTGGATTTTTCCtTTTtgc (SEQ ID NO: 110) 

ttataaaattttaaagtatgttcaag (SEQ ID NO: ill) 
agtttgttaaatttttaaaattttatttttacttag (SEQ ID NO: 112) 

15 GCTTCTCCTGGCAATCTCTCTCAGTTTGAAGACATTCTCTTTGGTAACAATGAT 
ATGTCAGCTTCCATTGGTGTTGTGGGTGTTAAAATGTCCGCAGTTGATGGCCAGAGACAG 
GTTGGAGTTGGGTATGTGGATTCCATACAGAGGAAACTAGGACTGTGTGAATTCCCTGAT 
AATGATCAGTTCTCCAATCTTGAGGCTCTCCTCATCCAGATTGGACCAAAGGAATGTGT 
TTTACCCGGAGGAGAGACTGCTGGAGACATGGGGAAACTGAGACAG (SEQ ID NO: 113) 

20 gtaagcaaattgagtctagtgat (SEQ ID NO: 114) 

agaggagattccaggccta g gaaa g (SEQ ID NO: 115) 
gc 

TCTTTAATTGACATGATACTG (SEQ ID NO: 116) 

hMSH2 Exon 4 

25 ttca 

tttttgcttttcttattccttttc (SEQ ID NO: 117) 
tcatagtagtttaaactatttctttcaaaatag (SEQ ID NO: 118) 
ATAATTCAAAGAGGAGGAATTCTGATCACAGAAAGAAAAAAAGCTGACTTTTCCACAAA 
AGACATTTATCAGGACCTCAACCGGTTGTTGAAAGGCAAAAAGGGAGAGCAGATGAATA 
3 0 GTGCTGTATTGCCAGAAATGGAGAATCAG (SEQ ID NO: 119) 
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gtacatggattataaatgtgaattacaatatatataatgtaaatatgtaatatataata 
aataatatgtaaactatagtgacttt (SEQ ID NO: 120) 
ttagaaggatatttctgtca {SEQ ID NO: 121) 
tat 

5 llMSH2 Exon 5 

actggcacca (SEQ ID NO: 122) 
gtggtatagaaatcttcgattttt (SEQ ID NO: 123) 
aaattcttaattttag (SEQ ID NO: 124) 

GTTGCAGTTTCATCACTGTCTGCGGTAATCAAGTTTTTAGAACTCTTATCAGATGATTC 
10 CAACTTTGGACAGTTTGAACTGACTACTTTTGACTTCAGCCAGTATATGAAATTGGATA 
TTGCAGCAGTCAGAGCCCTTAACCTTTTTCAG (SEQ ID NO: 125) 
gtaaaaaaaaaaaaaaaaaaaaa (SEQ ID NO: 126) 
aaaagggttaaaaatgttgatt (SEQ ID NO: 127) 

gg 

15 TTAAAAAATGTTT (SEQ ID NO: 12 8) 
t 

caTTGACATATAGTGAAGAAGCT (SEQ ID NO:129) 
TATAAAGGAGCTAAAATATTTGGAAAT (SEQ ID NO: 13 0) 
att 

2 0 ATTATACTTGGATTAGATAACTAGCTTTAAATGGGTGTATTTT (SEQ ID NO; 131) 

hMSH2 Exon 6 

gtt 

ttcactaatgagcttgccattc (SEQ ID NO: 132) 
tttctattttattttttgtttactag (SEQ ID NO: 133) 

25 GGTTCTGTTGAAGATACCACTGGCTCTCAGTCTCTGGCTGCCTTGCTGAATAAGTGTAA 
AACCCCTCAAGGACAAAGACTTGTTAACCAGTGGATTAAGCAGCCTCTCATGGATAAGA 
ACAGAATAGAGGAGAG (SEQ ID NO: 134) 

gtatgttattagtttatactttcgttagttttatgtaacctgca (SEQ ID NO: 135) 
gttacccacatgattatacc (SEQ ID NO: 13 6) 

3 0 ac 
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hMSH2 Exon 7 

ga 

cttacgtgcttagttgataa (SEQ ID NO: 13 7) 

attttaattttatactaaaatattttacattaattcaagttaatttatttcag (SEQ ID 
5 NO: 13 8) 

ATTGAATTTAGTGGAAGCTTTTGTAGAAGATGCAGAATTGAGGCAGACTTTACAAGAAG 
ATTTACTTCGTCGATTCCCAGATCTTAACCGACTTGCCAAGAAGTTTCAAAGACAAGCA 
GCAAACTTACAAGATTGTTACCGACTCTATCAGGGTATAAATCAACTACCTAATGTTAT 
ACAGGCTCTGGAAAAACATGAAG (SEQ ID NO: 13 9) 
10 gtaacaagtgattttgtttttttg (SEQ ID NO: 140) 
ttttccttcaactcatacaatata (SEQ ID NO: 141) 
tac 

hMSH2 Exon 8 

ga 

15 tttgtattctgtaaaatgagatcttt (SEQ ID NO:142) 

ttatttgtttgttttactactttcttttag (SEQ ID NO; 143) 

GAAAACACCAGAAATTATTGTTGGCAGTTTTTGTGACTCCTCTTACTGATCTTCGTTCT 
GACTTCTCCAAGTTTCAGGAAATGATAGAAACAACTTTAGATATGGATCAG ( SEQ ID 
NO: 144) 

20 gtatgcaatatactttttaatttaag (SEQ ID NO: 145) 
cagtagttatttttaaaaagcaaag (SEQ ID NO: 146) 
gcc 

hMSH2 Exon 9 

gt 

25 ctttacccattatttataggatt (SEQ ID N0:147) 
ttgtcactttgttctgtttgcag (SEQ ID NO: 148) 

GTGGAAAACCATGAATTCCTTGTAAAACCTTCATTTGATCCTAATCTCAGTGAA 
TTAAGAGAAATAATGAATGACTTGGAAAAGAAGATGCAGTCAACATTAATAAGTGCAGC 
CAGAGATCTTG (SEQ ID NO: 14 9) 
30 gtaagaatgggtcattggag (SEQ ID NO: 150) 

gttggaataattcttttgtctat (SEQ ID NO: 151) 
ac 
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hMSH2 Exon 10 

gg 

tagtaggtatttatggaatactttt (SEQ ID NO: 152) 
tcttttcttcttgtttatcaag (SEQ ID NO: 153) 
5 GCTTGGACCCTGGCAAACAGATTAAACTGGATTCCAGTGCACAGTTTGGATATTACTTTC 
GTGTAACCTGTAAGGAAGAAAAAGTCCTTCGTAACAATAAAAACTTTAGTACTGTAGATA 
TCCAGAAGAATGGTGTTAAATTTACCAACAG (SEQ ID NO: 154) 
gtttgtaagtcattattatatttttaaccctttatt (SEQ ID NO: 155) 
aattccctaaatgctctaaca (SEQ ID NO:156) 
10 tg 

hMSH2 Exon 11 

ca 

cattgcttctagtacacattt (SEQ ID NO:157) 
taatatttttaataaaactgttatttcgatttgcag (SEQ ID NO: 158) 
1 5 CAAATTGACTTCTTTAAATGAAGAGTATACCAAAAATAAAACAGAAT ATGAAGAAGCCC 
AGGATGCCATTGTTAAAGAAATTGTCAATATTTCTTCAG (SEQ ID NO: 159) 
gtaaacttaatagaactaa (SEQ ID NO: 160) 
taatgttctgaatgtcacctg (SEQ ID N0:l6l) 

g 

2 0 hMSH2 Exon 12 

at 

tcagtattcctgtgtacattt (SEQ ID NO:162) 
tctgtttttatttttatacag (SEQ ID NO: 163) 

GCTATGTAGAACCAATGCAGACACTCAATGATGTGTTAGCTCAGCTAGATGCTGTTGTC 
25 AGCTTTGCTCACGTGTCAAATGGAGCACCTGTTCCATATGTACGACCAGCCATTTTGGAGAA 

AGGACAAGGAAGAATTATATTAAAAGCATCCAGGCATGCTTGTGTTGAAGTTCAAGATG 

AAATTGCATTTATTCCTAATGACGTATACTTTGAAAAAGATAAACAGATGTTCCACATC 

ATTACTG (SEQ ID NO: 164) 

gtaaaaaacctggttt (SEQ ID NO: 165) 
30 ttgggctttgtgggggtaa (SEQ ID NO:166) 

eg 
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hMSH2 Exon 13 

eg 

cgattaatcatcagtgtac (SEQ ID NO: 167) 

agtttaggactaacaatccatttattagtagcagaaagaagtttaaaatcttgctttct 
5 gatataatttgttttgtag (SEQ ID NO: 168) 

GCCCCAATATGGGAGGTAAATCAACATATATTCGACAAACTGGGGTGATAGTACT 
CATGGCCCAAATTGGGTGTTTTGTGCCATGTGAGTCAGCAGAAGTGTCCATTGTGGACTG 
CATCTTAGCCCGAGTAGGGGCTGGTGACAGTCAATTGAAAGGAGTCTCCACGTTCATGGC 
TGAAATGTTGGAAACTGCTTCTATCCTCAG (SEQ ID NO: 16 9) 
10 gtaagtgcatctcctagtccctt (SEQ ID NO: 170) 
gaagatagaaatgtatgtctctg (SEQ ID N0:17l) 
tec 

liMSH2 Exon 14 

ta 

15 ccacattttatgtgatgggaa (SEQ ID NO: 172) 

atttcatgtaattatgtgcttcag (SEQ ID NO: 173) 

GTCTGCAACCAAAGATTCATTAATAATCATAGATGAATTGGGAAGAGGAACTTCTACCTA 
CGATGGATTTGGGTTAGCATGGGCTATATCAGAATACATTGCAACAAAGATTGGTGCTTT 
TTGCATGTTTGCAACCCATTTTCATGAACTTACTGCCTTGGCCAATCAGATACCAACTGT 
2 0 TAATAATCTACATGTCACAGCACTCACCACTGAAGAGACCTTAACTATGCTTTATCAGGT 
GAAGAAAG (SEQ ID NO: 174) 
gtatgtactattggagtactctaaattcagaacttg 
gtaatgggaaacttactacc (SEQ ID NO: 175) 
cc 

2 5 hMSH2 Exon 15 

ct 

cttctcatgctgtcccctc (SEQ ID NO: 176) 
acgcttccccaaatttcttatag (SEQ ID NO: 177) 

GTGTCTGTGATCAAAGTTTTGGGATTCATGTTGCAGAGCTTGCTAATTTCCCTAAGCAT 

3 0 GTAATAGAGTGTGCTAAACAGAAAGCCCTGGAACTTGAGGAGTTTCAGTATATTGGAGA 

ATCGCAAGGATATGATATCATGGAACCAGCAGCAAAGAAGTGCTATCTGGAAAGAGAG 
(SEQ ID NO:178) 

gtttgtcagtttgtttt (SEQ ID NO: 179) 
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catagtttaacttagcttctc (SEQ ID NO:180) 
tat 

hMSH2 Exon 16 

ta 

5 attactcatgggacattcaca (SEQ ID NO: 181) 
tgtgtttcag (SEQ ID NO: 182) 

CAAGGTGAAAAAATTATTCAGGAGTTCCTGTCCAAGGTGAAACAAATGCCCTTTAC 
TGAAATGTCAGAAGAAAACATCACAATAAAGTTAAAACAGCTAAAAGCTGAAGTAATAGC 
AAAGAATAATAGCTTTGTAAATGAAATCATTTCACGAATAAAAGTTACTACGTGA ( SEQ 
10 ID NO:183) 
aaa 

atcccagtaatggaa tgaag (SEQ ID NO: 184) 
gta 

hMLHl and liMSH2 genes were sequenced in 5 0 cancer 
15 patients (age of onset <30) and 26 random anonymous donors. 
Initial genomic sequencing detected 12 germline mutations in 
12 patients (24%) . Five mutations were found in hMLHl^ and 
7 in hMSH2 . Using a combination of genomic sequencing and in 
vitro synthesized-protein- truncation assay (IVSP) ^ a total of 
20 15 germ- line mutations were identified. The mutations are 
described in Table 1 . 



Table 1: Pathogenic hMLHl and hMSH2 Mutations Identified in 
Young Colorectal Cancer Probands 



Gene 
and 

Patient 


Mutation 


Nucleotide 
Change 


Effect on 

Coding 
Sequence 


Location 


hMLHl : 


329 


GlGdelAAG 


Deletion of 
AAG at 1846- 
1848 


Deletion of 
Lys616 


Exon 1 6 


533 


IVS8- 
SdelTA 


Deletion of 
TA at 677-3 


Splice 
mutation 


IVS 8 
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696 


K618A 


AA-^GC at 
1852-1853 


Lys-*Ala at 
618 


Exon 1 6 


804 


R659X 


C-T at 1975 


Arg-^Stop at 
659 


Exon 17 


815 


IVSl+lG-A 


G-A at 116+1 


Splice 
TTin t" t" "i OTi 


IVS 1 


817 


del exon 

-L J 


Deletion of 

involving IVS 
12 through 
exon 13 to 
IVS 13 


Deletion of 
codons 47 0- 
520 (exon 
13) 


IVS 12- 
13 . exon 
13 


889 




not 

identified 


Truncation 
of IVSP 


Exons 
12-19 


hMSH2 : 


528 


R406X 


C-T at 1216 


Arg->Stop at 
406 


Exon 7 


579 


H639Y 
IVS13-1G-T 


C-T at 1915 
G-T at 2211 


Double 
mutation 
results in 
deletion of 
codons 58 8- 
82 0 (exons 
12-14) 


Exon 12, 
IVS 13 


814 


Q601X 


C-T at 1801 


Gin-Stop at 
601 


Exon 12 


818 


Q252X 


C-T at 754 


Gin-Stop at 
252 


Exon 4 


825 


delCTGT 


Deletion of 
CTGT at 808- 
811 


Deletion of 
codons 2 65- 
314 (exon 5) 


Exon 5 


830 


R680X 


C-T at 2038 


Arg-^Stop at 
680 


Exon 13 


1157 


MIL 


A-T at 1 


New 

initiation 
at codon 26 


Exon 1 



''IVS" means intervening sequence. 



15 Two of the mutations identified in Table 1 for hMLHl and three 
of the mutations identified in Table 1 for hMSH2 are believed 
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to be new. For hMLHl, these include: the splice mutation IVSI 
+ IG-A in patient 815, also referred to herein as "hMLHl 
mutant 1"; and deletion of exon 13 in patient 817, also 
referred to herein as ''hMLHl mutant 2". For hMSH2, these 
5 include the double mutation H63 9Y IVS13-1G-T leading to 
deletion of codons 588-820 in patient 579, also referred to 
herein as "hMSH2 mutant 1", mutation R680X in patient 830 
wliich comprises a nucleotide change from C to T at position 
2038 in Exon 13 and results in a stop codon at position 680 

10 of the coding sequence, also referred to herein as "hMSH2 
mutant 2"; and mutation MIL in patient 1157 which comprises 
a nucleotide change from A to T at position 1 resulting in a 
new initiation at codon 26, also referred to herein as "hMSH2 
mutant 3". Detection of these genetic mutations is useful in 

15 diagnosing HNPCC in a patient and determining susceptibility 
of a patient for developing HNPCC. 

There are several methodologies available from 
recombinant DNA technology which may be used for detecting 
these new variants and identifying additional genetic 

20 mutations responsible for colon cancer. The identification 
of intronic sequences of hMLHl and h]yiSH2 provided herein is 
particularly useful for design of intronic such as those 
exemplified in SEQ ID N0:1, 5, 7, 11, 12, 16, 17, 21, 23, 27, 
28, 32, 33, 37, 38, 42, 43, 47, 48, 52, 53, 57, 58, 62, 63, 

25 67, 68, 72, 73, 77, 78, 82, 83, 87, 88, 92, 93, 97, 100, 104, 
105, 109, 111, 115, 117, 121, 123, 121, 123, 127, 129, 132, 
136, 137, 141, 142, 146, 147, 151, 152, 156, 157, 161, 162, 
166, 167, 171, 172, 175, 176, 180, 181 and 184 for use in 
identifying mutants in the splice donor or acceptor sites of 

3 0 the hMLHl or hMSH2 gene. Examples of methodologies useful in 
detecting and identifying new variants of these genes include, 
but are not limited to, direct probing, ligase chain reaction 
(LCR) and polymerase chain reaction (PGR) methodology. 

Detection of variants or mutants using direct probing 

35 involves the use of oligonucleotide probes which may be 
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prepared synthetically or by nick translation. In a preferred 
etiobodiment , the probes are complementary to at least a portion 
of the variant hMLHl or hMSH2 genes identified herein. The 
DNA probes may be suitably labeled using, for example, a 
5 radiolabel, enzyme label, fluorescent labels or biotin-avidin 
label, for subsequent visualization in for example a Southern 
blot hybridization procedure. The labeled probe is reacted 
with a sample of DMA from a patients suspected of having HNPCC 
bound to nitrocellulose or Nylon 66 substrate. The areas that 

10 carry DNA sequences complementary to the labeled DNA probe 
become labeled themselves as a consequence of the reannealing 
reaction. The areas of the filter that exhibit such labeling 
may then be visualized, for example, by autoradiography. 

Alternative probe techniques, such as ligase chain 

15 reaction (LCR) involve the use of a mismatch probe, i.e., 
probes which have full complementarity with the target except 
at the point of the mutation or variation. The target 
sequence is then allowed to hybridize both with the 
oligonucleotides having full complementarity, i.e., 

2 0 oligonucleotides complementary to the hMLHl or hMSH2 variants 

of the present invention, and oligonucleotides containing a 
mismatch under conditions which will distinguish between the 
two. By manipulating the reaction conditions, it is possible 
to obtain hybridization only where there is full 
25 complementarity. If a mismatch is present, then there is 
significantly reduced hybridization. 

The polymerase chain reaction (PGR) is a technique that 
amplifies specific DNA sequences. Repeated cycles of 
denaturation, primer annealing and extension carried out with 

3 0 a heat stable enzyme Taq polymerase leads to exponential 

increases in the concentration of desired DNA sequences . 

Given the knowledge of nucleotide sequences encoding the 
hMLHl and hMSH2 genes, it is possible to prepare synthetic 
oligonucleotides complementary to the sequences which flank 
35 the DNA of interest. Each oligonucleotide is complementary 
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to one of the two strands. The DMA is then denatured at high 
temperatures (e^gr., 95°C) and then reannealed in the presence 
of a large molar excess of oligonucleotides. The 
oligonucleotides, oriented with their 3' ends pointing towards 
5 each other, hybridize to opposite strands of the target 
sequence and prime enzymatic extension along the nucleic acid 
template in the presence of the four deoxyribonucleotide 
triphosphates. The end product is then denatured again for 
another cycle. After this three-step cycle has been repeated 

10 several times, amplification of a DNA segment by more than one 
million fold can be achieved. The resulting DNA may then be 
directly sequenced in order to locate any genetic alterations. 
Alternatively, the identified hMLHl and hMSH2 variants of the 
present invention make it possible to prepare oligonucleotides 

15 that will only bind to altered DNA, so that PGR will only 
result in the multiplication of the DNA if the mutation is 
present. Following PGR, allele-specif ic oligonucleotide 
hybridization may be used to detect the colon cancer point 
mutation. 

2 0 Alternatively, an adaptation of PGR called amplification 

of specific alleles (PASA) can be employed; this method uses 
differential amplification for rapid and reliable distinction 
between alleles that differ at a single base pair. Newton et 
al. Nucleic Acid Res. 1989 17:2503; Nichols et al . Genomics 

25 1989 5:535; Okayama et al . J. Lab. Glin. Med. 1989 1214:105; 
Sarkar et al . Anal. Biochem, 1990 186:64; Sommer et al . Mayo 
Glin. Proc. 1989 64:1361; Wu, Proc . Nat " 1 Acad. Sci. USA 1989 
86:2757; and Dutton et al . Biotechniques 1991 11:700. PASA 
involves amplification with two oligonucleotide primers such 

30 that one is allele specific. The desired allele is 
efficiently amplified, while the other allele (s) is poorly 
amplified because it mismatches with a base at or near the 3' 
end of the allele specific primer. Thus, PASA or the related 
method PAMSA can be used to specifically amplify one or more 

35 mutant hMLHl or hMSH2 alleles. Where such amplification is 
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performed on genetic material obtained from a patient, it can 
serve as a method of detecting the presence of one or more 
mutant hMLHl and/or hMSH2 alleles in a patient. PCR-induced 
mutation restriction analysis, often referred to as IMRA, can 
5 also be used in the detection of mutants. 

Also important is the development of experimental models 
of HNPCC. Such models can be used to screen for agents that 
alter the degenerative course of HNPCC. Having identified 
specific mutations in the hMLHl and hMSH2 genes as a cause of 

10 HNPCC, it is possible using genetic manipulation, to develop 
transgenic model systems and/or whole cell systems containing 
a mutated hMLHl and/or hMSH2 gene or a portion thereof. The 
model systems can be used for screening drugs and evaluating 
the efficacy of drugs in treating HNPCC. In addition, these 

15 model systems provide a tool for defining the underlying 
biochemistry of hMLHl and hMSH2 and their relationship to 
HNPCC, thereby providing a basis for rational drug design. 

One type of cell system which can be used in the present 
invention can be naturally derived. For this, blood samples 

20 from an affected individual are obtained and permanently 
transformed into a lymphoblastoid cell line using, for 
example, Epstein-Barr virus. Once established, such cell 
lines can be grown continuously in suspension cultures and can 
be used in a variety of in vitro experiments to study hMLHl 

25 and hMSH2 expression and processing. Another cell line used 
in these studies comprises skin fibroblasts derived from 
patients , 

The mutated gene can also be excised for use in the 
creation of transgenic animals containing the mutated gene. 

3 0 For example, the hMLHl and hMSH2 variants of the present 
invention can each be cloned and placed in a cloning vector. 
Examples of cloning vectors which can be used include, but are 
not limited to, lCharon35, cosmid, or yeast artificial 
chromosome. The variant hMLHl or hMSH2 gene can then be 

35 transferred to a host nonhuman knockout animal such as a 
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knockout mouse. As a result of the transfer, the resultant 
transgenic nonhuman animal will preferably express one or more 
of the variant hMLHl or hMSH2 polypeptides. 

Alternatively, minigenes encoding variant hMLHl or hMSH2 
5 polypeptides can be designed. Such minigenes may contain a 
cDNA sequence encoding a variant hMLHl or hMSH2 polypeptide, 
preferably full-length, a combination of hMLHl or hMSH2 exons, 
or a combination thereof, linked to a downstream 
polyadenylation signal sequence and an upstream promoter (and 

10 preferably enhancer). Such a minigene construct will, when 
introduced into an appropriate transgenic host, such as a 
mouse or rat, express a variant hMLHl or hMSH2 polypeptide. 

One approach to creating transgenic animals is to target 
a mutation to the desired gene by homologous recombination in 

15 an embryonic stem (ES) cell in vitro followed by 
microinjection of the modified ES cell line into a host 
blastocyst and subsequent incubation in a foster mother. 
Frohman et al. Cell 1989 56:145. Alternatively, the technique 
of microinjection of the mutated gene, or portion thereof, 

20 into a one-cell embryo followed by incubation in a foster 
mother can be used. Additional methods for producing 
transgenic animals are well known in the art. 

Transgenic animals are used in the assessment of new 
therapeutic compositions and in carcinogenicity testing, as 

25 exemplified by U.S. Patent 5,223,610. These animals are also 
used in the development of predictive animal models for human 
disease states, as exemplified in U.S. Patent 5,221,778. 
Therefore, the novel mutations of the hMLHl and hMSH2 genes 
of the present invention, which are believed to cause HNPCC, 

3 0 provide a useful means for developing knockout transgenic 
animals to assess this disease. 

Site directed mutagenesis and/or gene conversion can 
also be used to a mutate a non human hMLHl or hMSH2 gene 
allele, either endogenously or via transf ection, such that the 



DEX-0054 - 22 - PATENT 

mutated gene encodes a polypeptide with an altered amino acid 
as described in the present invention. 

In addition, antibodies to the hMLHl or hMSH2 gene and 
variants thereof can be raised for use in the examination of 
5 the function of the truncated transcripts of the hMLHl or 
hMSH2 gene. These antibodies can be, for example, polyclonal 
or monoclonal antibodies. The present invention also includes 
chimeric, single chain, and humanized antibodies, as well as 
Fab fragments, or the product of an Fab expression library. 

10 Various procedures known in the art may be used for the 
production of such antibodies and fragments. 

Antibodies generated against the hMLHl and hMSH2 genes 
of the present invention can be obtained by direct injection 
into an animal or by administering the gene to an animal, 

15 preferably a nonhuman. The antibody so obtained will then 
bind the hMLHl or hMSH2 gene or itself. In this manner, even 
a fragment of the gene can be used to generate thee 
antibodies . 

For preparation of monoclonal antibodies, any technique 
20 which provides antibodies produced by continuous cell line 

cultures can be used. Examples include the hybridoma 

technique (Kohler et al . Nature 1975 256:495-497), the trioma 

technique, the human B-cell hybridoma technique (Kozbor et al . 

Immunology Today 1983 4:72), and the EBV-hybridoma technique 
25 to produce human monoclonal antibodies (Cole et al. in 

Monoclonal Antibodies and Cancer Therapy f Alan R. Liss, Inc., 

1985, pp. 77-96) . 

Techniques described for the production of single chain 

antibodies (U.S. Patent 4,946,778) can be adapted to produce 
3 0 single chain antibodies to the hMLHl or hMSH2 genes of this 

invention. Also, transgenic mice may be used to express 

humanized antibodies to the hMLHl or hMSH2 genes of this 

invention. 

The following nonlimiting examples are provided to 
35 further illustrate the present invention. 
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EXAMPLES 

Example 1: Patients and Samples 

A total of 76 subjects were studied: 50 unrelated 
patients diagnosed with colorectal cancer at <3 0 years of age 
5 and 26 anonymous donors. There were 15 male and 11 female 
anonymous donors who were cancer free at the time of sampling 
and whose mean ages was 41 years. None of the study subjects 
were referred specifically because of a family history of 
colon cancer. All cancer patients had histologically 
10 confirmed colorectal cancer. 

Peripheral blood was drawn from each subject and DNA was 
purified from peripheral-blood leukocytes. 

Example 2 : Genomic Sequencing 

15 DNA was extracted from peripheral blood using the 

Nucleon DNA Extraction Kit, Scotlab, Lanarkshire, U.K. or 
using the Puregene DNA Isolation Kit (Centra Systems, 
Minneapolis, MN) as per the manufacturer's instructions. 
Each exon of hMSH2 and hMLHl was amplified by PGR using 4 0 

20 ng of genomic DNA in a volume of 50 fih. Final reaction 

concentrations were 1 x PGR Buffer II (Perkin Elmer), 3.0 
mM MgCl2 (or 1.5 mM for hMSH2 exon 1), 0,2 mM dNTPs, 10 pmol 
of each specific oligonucleotide primer, and 1.25 units of 
Taq polymerase. Amplification was hot-started at for 3 

25 minutes, followed by 35 cycles of 94°C for 20 seconds; SS^'C 
for 20 seconds; 72°C for 40 seconds. The final reaction was 
extended at 72°G for 10 minutes, followed by storage at 4''C, 
Cycle sequencing used the PRISM Ready Dye Terminator Cycle 
Sequencing kit with AmpliTaq DNA polymerase, FS (Taq-FS; 

30 Perkin Elmer/Applied Biosystems) and an Applied Biosystems 
DNA Sequencer model 373A or 377 (Parker et al . 
BioTechniques 1996 21:694-699) according to the 
manufacturer's instructions. DNA sequence analysis was 
performed using Sequencher 3.0 (Gene Codes, Ann Arbor, MI) 

3 5 software by comparing published genomic sequences of hMLHl 
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(Han et al , Hum, Mol . Genet. 1995 4:237-242; Kolodner et 
al. Cancer Res, 1995 55:242-248) and hMSH2 (Kolodner, et 
al . Genomics 1994 24:516-526) with that of cancer cases or 
of random donors . 



5 Examples of primers used for mutations in patients 815, 83 0 
and 1157 are as follows: 

(1) Patient 815, splice error in hMLHl exon 1: 
Forward primer: 

5 ' -TGTAAAACGACGGCCAGTCTGAGGTGATTGGCTGAAG-3 ' (SEQ ID NO: 
10 185) 

Reverse primer: 
5 ' -GGAAACAGCTATGACCATGCCGTTAAGTCGTAGCCCTT-3 ' (SEQ ID NO: 
186) 

(2) Patient 830, premature stop codon in hMSH2 exon 

15 13 : 

Forward primer: 
5 ' -TGTAAAACGACGGCCAGTCGATTAATCATCAGTGTAC-3 ' (SEQ ID NO: 
187) 

Reverse primer: 

2 0 5 ' -GGAAACAGCTATGACCATGCAGAGACATACATTTCTATCTTC-3 ' (SEQ ID NO: 
188) 

(3) Patient 1157, missense in initial ATG of hMSH2 
(exon 1) : 

Forward primer: 
2 5 5 ' -TGTAAAACGACGGCCAGTCGCATTTTCTTCAACCAGGA-3 ' (SEQ ID NO: 
189) 

Reverse primer: 
5 ' -GGAAACAGCTATGACCATGCCTCCCCAGCACGCGCC - 3 ' ( SEQ ID NO : 190) 



Example 2: In Vitro Synthesized-Protein-Truncation 

3 0 Assay (IVSP) 

cDNA was generated by reverse transcription of RNA 
purified from lymphoblastoid cell lines from the affected 
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index case. PGR amplification of the cDNA was used to 
introduce a 17 -bp consensus T7 promoter sequence and a 
mammalian translation-initiation sequence in frame with a 
unique hMLHl or hMSH2 sequence. PGR primer sequences and 
5 conditions were similar to those previously described in 
Example l. Each gene was amplified in two or three 
overlapping segments. Resultant PGR products were used in 
a coupled transcription-translation reaction (Promega) 
incorporating 2-5 fxCi. of ^^S-methionine . Labeled in vitro- 

10 transcribed protein products from the reaction were heat 
denatured and were analyzed by use of 8%, 10% and 12% SDS- 
PAGE gels. Gels were washed in fixative and were 
autoradiographed overnight at room temperature. All 
samples showing truncated protein products were reamplified 

15 independently, and an additional IVSP analysis was 

performed for conformation. For each analysis, normal 
control samples were run in parallel, and the wild-type 
full length protein was noted* In most analyses, 
artif actual bands were visible, presumably due to 

20 internally initiations since these were visible in samples 
form normal controls. 



Example 3 : Long Range PCR 

For long range PCR of the novel mutation of hMSH2 
discovered in patient 817, the GeneAmp XL PCR Kit (Perkin 
25 Elmer) was used with the following primers: 

Forward primer: 
5 ' -GGCCATTGTCACAGAGGATAAGA-3 ' (SEQ ID NO : 191) 

Reverse primer: 
5 ' -ACACAGCCCACGAAGGAGTG-3 ' (SEQ ID NO: 192) 
30 The reaction mixture contained about 400 ng of genomic DNA in 
a volume of 50 //L. Final reaction concentrations were 1 x PCR 
Buffer II (Perkin Elmer), 1.5 mM Mg(0Ac)2/ 0,8 mM dNTPs, 40 
pmol of each specific oligonucleotide primer, and 4 units of 
rTth DNA polymerase. Amplification was hot- started at 94'^C 
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for 1 minute, followed by 26 cycles of for 15 second and 

68^C for 10 minutes. The final reaction was extended at 72°C 
for 10 minutes, followed by storage at 4°C. 

Replicate cDNA sequencing of samples from patient 817 
5 reproducibly demonstrated a truncation in hMSH2 due to 
deletion of the entire exon 13. However, extensive genomic 
sequencing failed to identify the mutation at the DNA level. 
Hence, the intronic region around exon 13 was analyzed by long 
range PGR to determine whether any large genomic deletion had 
10 completely removed that exon. Forward primer was in exon 12 
and reverse in exon 14, giving around 15.5 kb wild type 
product. Using this approach, patient 817 was shown to carry 
a large deletion of approximately 3 kb which resulted in 
removal of exon 13 . 

15 Example 4: Characterization of mutation in patient 579 

Characterization of the mutation in patient 579 was more 
complex. replicate hMSH2 IVSPs for patient 579 detected a 
very short protein fragment, which could not be explained on 
the basis of the His-^Tyr mutation at codon 639, identified by 

20 genomic sequencing. Accordingly, additional genomic 

sequencing needed to be performed which resulted in 
identification of the second mutation at the splice acceptor 
site of exon 14. Using restriction-site changes induced by 
each mutation, both variants were traced through the family 

25 and were shown to reside on the same allele. Extensive 
sequencing of the reverse transcription-PCR products revealed 
that this complex double mutation results in an in-frame 
deletion of exons 12-14, thus accounting for the very short 
IVSP fragment. A His-^Tyr mutation at codon 639 which results 

3 0 in a surrogate splice donor site and a 92 -bp frameshift 
deletion of nucleotides 1914-2006, generating a premature 
termination codon 17 bp downstream of the exon 13 splice 
acceptor site has been described previously by Leach et al . 
Cell 1993 75:1215-1225 and Liu et al . Cancer Res. 1994 
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54:4590-4594. However, the 92 bp splice mutation reported to 
be present in this mutation was not present in patient 579, 
thus confirming that the double mutation in patient 579 is 
distinct from that reported by Liu et al. Cancer Res* 1994 
5 54:4590-4594. 

All publications including; but not limited to, patents 
and patent applications, cited in this specification, are 
herein incorporated by reference as if each individual 
publication were specifically and individually indicated to 

10 be incorporated by reference herein as though fully set forth. 

The above description fully discloses the invention, 
including preferred embodiments thereof. Modifications and 
improvements of the embodiments specifically disclosed herein 
are within the scope of the following claims. Without further 

15 elaboration, it is believed that one skilled in the art can, 
using the preceding description, utilize the present invention 
to its fullest extent. Therefore, the examples provided 
herein are to be construed as merely illustrative and are not 
a limitation of the scope of the present invention in any way. 

20 The embodiments of the invention in which an exclusive 
property or privilege is claimed are defined as follows. 
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What is claimed is : 

1. A variant human MLHl or MSH2 gene comprising hMLHl 
mutant 1, hMLHl mutant 2, hMSH2 mutant 1, hMSH2 mutant 2 or 
hMSH2 mutant 3 , 

5 2 . A method of diagnosing hereditary non-polyposis 

colorectal cancer in a patient comprising: 

(a) obtaining a DNA sample from a patient; and 

(b) screening the DNA sample for the variant human MLHl 
or MSH2 gene of claim 1, wherein the presence of the variant 

10 gene is indicative of hereditary non-polyposis colorectal 
cancer . 

3 . A method for predicting susceptibility of a patient 
to developing hereditary non-polyposis colorectal cancer 
comprising: 

15 (a) obtaining a DNA sample from a patient; and 

(b) screening the DNA sample for the variant human MLHl 
or MSH2 gene of claim 1, wherein the presence of the variant 
gene is indicative of a susceptibility to hereditary non- 
polyposis colorectal cancer. 

20 4. A method of identifying mutants in splice donor or 

acceptor sites of a human MLHl gene comprising sequencing 
splice donor or acceptor sites of the human MLHl gene with 
intronic primers for the human MLHl gene and analyzing the 
sequences to identify any mutants. 

25 5 . An intronic primer for human MLHl. 

6 . A method of identifying mutants in splice donor or 
acceptor sites of a human MSH2 gene, comprising sequencing 
splice donor or acceptor sites of the human MSH2 gene with 
intronic primers for the human MSH2 gene and analyzing the 
30 sequences to identify any mutants. 
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7 . An intronic primer for human MSH2 , 

8. A transgenic model system for colorectal cancer 
comprising cells expressing the variant human MLHl or MSH2 
gene of claim 1. 
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ABSTRACT 

Variant human MLHl and MSH2 genes are provided. Methods 
of using these variant genes to diagnose hereditary non- 
polyposis colorectal cancer (HNPCC) and/or determine a 
5 patient's susceptibility to developing HNPCC are also 
provided. Methods and compositions for identifying new 
variant MLHl of MSH2 genes are also provided. In addition, 
experimental models for hereditary non-polyposis colorectal 
cancer comprising these variant genes are provided. 



Express Mail Label No. EL429958222US ^^9® "* °^ ^ 



Docket No. 
DEX-0054 



Declaration and Power of Attorney For Patent Application 

English Language Declaration 

As a below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below next to my name, 

1 believe I am the original, first and sole inventor (if only one name is listed below) or an original, 
first and joint inventor (if plural names are listed below) of the subject matter which is claimed and for 
which a patent is sought on the invention entitled 

NOVEL MUTATIONS IN HUMAN MLHl AND HUMAN MSH2 GENES USEFUL IN DIAGNOSING 
COLORECTAL CANCER 

the specification of which 

(check one) 

gl is attached hereto. 

□ was filed on as United States Application No. or PCT International 

Application Number 



and was amended on 



(if applicable) 



I hereby state that I have reviewed and understand the contents of the above identified specification, 
including the claims, as amended by any amendment referred to above. 

I acknowledge the duty to disclose to the United States Patent and Trademark Office all information 
known to me to be material to patentability as defined in Title 37, Code of Federal Regulations, 
Section 1 .56. 

I hereby claim foreign priority benefits under Title 35, United States Code, Section 119(a)-(d) or 
Section 365(b) of any foreign application(s) for patent or inventor's certificate, or Section 365(a) of 
any PCT International application which designated at least one country other than the United States, 
listed below and have also identified below, by checking the box, any foreign application for patent or 
inventor's certificate or PCT Intemational application having a filing date before that of the application 
on which priority is claimed. 

Prior Foreign Application(s) Priority Not Claimed 

□ 
□ 

□ 



(Number) (Country) (Day/Month/Year Filed) 



(Number) (Country) (Day/Month/Year Filed) 



(Number) (Country) (Day/MonthA'ear Filed) 



Form PTO-SB-01 (9-95) (Modified) 



P02/REV02 



Patent and Trademark Offlce-U.S. DEPARTMENT OF COIVIMERCE 



Page 2 of 4 



I hereby claim the benefit under 35 U.S.C. Section 119(e) of any United States provisional 
application(s) listed below: 

October 22, 1998 



^ ^' ^ 7 

(Application Serial No.) 


(Filing Date) 


(Application Serial No.) 


(Filing Date) 


(Application Serial No.) 


(Filing Date) 



I hereby claim the benefit under 35 U. S. C. Section 120 of any United States application(s), or 
Section 365(c) of any PCT International application designating the United States, listed below and, 
insofar as the subject matter of each of the claims of this application is not disclosed in the prior 
United States or PCT International application in the manner provided by the first paragraph of 35 
U.S.C. Section 1 12, I acknowledge the duty to disclose to the United States Patent and Trademark 
Office all information known to me to be material to patentability as defined in Title 37, C. F. R., 
Section 1.56 which became available between the filing date of the prior application and the national 
or PCT International filing date of this application: 



(Application Serial No.) (Filing Date) (Status) 

(patented, pending, abandoned) 



(Application Serial No.) (Filing Date) (Status) 

(patented, pending, abandoned) 



(Application Serial No.) (Filing Date) (Status) 

(patented, pending, abandoned) 

I hereby declare that all statements made herein of my own knowledge are true and that all 
statements made on information and belief are believed to be true; and further that these statements 
were made with the knowledge that willful false statements and the like so made are punishable by 
fine or imprisonment, or both, under Section 1001 of Title 18 of the United States Code and that such 
willful false statements may jeopardize the validity of the application or any patent issued thereon. 



Form PTO-SB-01 (6-95) (Modified) 



Patent and Trademarit Office-U.S. DEPARTIVIENT OF COIMMERCE 



Page 3 of 4 



POWER OF ATTORNEY: As a named inventor, I hereby appoint the following attorney(s) and/or 
agent(s) to prosecute this application and transact all business in the Patent and Trademark Office 
connected therewith . (list name and registration number) 
Jane Massey Licata, Reg. No. 32,257 

Kathleen A. Tyrrell, Reg. No. 38,350 

Laura Plunkett, Reg. No. P45,015 

of the firm 



Law Offices of Jane Massey Licata 
66 E, Main Street 
Marlton, NJ 08053 



Send Correspondence to: ^^"^ ^^^^^^ 

Law Offices of Jane Massey Licata 

66 E. Main Street 

Marlton, NJ 08053 



Direct Telephone Calls to: (name and telephone number) 
Jane Massey Licata Tel: 856-810-1515 



Full name of sole or first inventor 




David Robbins 




Sole or first inventor's signature 


Date 


Residence 




Stevenson Ranch, California 




Citizenship 




US 




Post Office Address 




26013 Baldwin Place 




Stevenson, Ranch, CA 91381 



Full name of second inventor, if any 




Juili Lillian Lin-Goerke 




Second inventor's signature 


Date 


Residence 




Spring City, Pennsylvania 




Citizenship 




Taiwan 




Post Office Address 




135 Hill Church Road 




Spring City, PA 19475 



Form PTO-SB-01 (6-95) (Modified) 



Patent and Trademark Office-U.S. DEPARTMENT OF COMMERCE 



Page 4 of 4 









Full name of third inventor, if any 
Jessica C. Ling 






Third inventor's signature 


Date 






Residence 

Bensalem, Pennsylvania 






Citizenship 
US 






Post Office Address 
735 Cliff Road 






Bensalem, PA 19020 












Fuli name of fourth inventor, if any 








Fourth inventor's signature 


Date 






Residence 




. 


Citizenship 






Post Office Address 












;: ii 








Full name of fifth inventor, if any 






Fifth inventor's signature 


Date 






Residence 






Citizenship 






Post Office Address 


















Full name of sixth inventor, if any 






Sixth inventor's signature 


Date 






Residence 






Citizenship 






Post Office Address 















Form PTO-SB-01 (6-95) (Modified) ' Patent and Trademark Office-U.S. DEPARTMENT OF COMMERCE 



SEQUENCE LISTING 

<110> Robbins, David 

Lin-Goerke, Juili L. 
Ling^ Jessica 

<120> Novel Mutations in Human MLHl and iyiSH2 Genes Useful in 
Diagnosing Colorectal Cancer 

<130> DEX-0054 

<140> 
<141> 

<160> 192 

<170> Patentin Ver. 2.0 

<210> 1 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 1 

aggcactgag gtgattggc 19 

<210> 2 

<211> 54 

<212> DNA 

<213> Homo sapiens 

<400> 2 

tgaaggcact tccgttgagc atctagacgt ttccttggct cttctggcgc caaa 54 

<210> 3 

<211> 51 

<212> DNA 

<213> Homo sapiens 

<400> 3 

ggaagttatc cagcggccag ctaatgctat caaagagatg attgagaact g 51 

<210> 4 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 4 



1 



gtacggaggg agtcgagccg g 



21 



<210> 5 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 5 

gctcacttaa gggctacga 19 

<210> 6 

<211> 8 

<212> DFA 

<213> Homo sapiens 

<400> 6 

cttaacgg 8 

<210> 7 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 7 

aatatgtaca ttagagtagt tg 22 

<210> 8 

<211> 35 

<212> DNA 

<213> Homo sapiens 

<400> 8 

cagactgata aattattttc tgtttgattt gccag 35 

<210> 9 
<211> 91 
<212> DNA 

<213> Homo sapiens 
<400> 9 

tttagatgca aaatccacaa gtattcaagt gattgttaaa gagggaggcc tgaagttgat 60 
tcagatccaa gacaatggca ccgggatcag g 91 

<210> 10 

<211> 46 

<212> DNA 

<213> Homo sapiens 



2 



<400> 10 

gtaagtaaaa cctcaaagta gcaggatgtt tgtgcgcttc atggaa 



46 



<210> 11 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 11 

gagtcaggac ctttctctg 19 

<210> 12 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 12 

agagatttgg aaaatgagta ac 22 

<210> 13 

<211> 34 

<212> DNA 

<213> Homo sapiens 

<400> 13 

atgattattt actcatcttt ttggtatcta acag 34 

<210> 14 

<211> 99 

<212> DNA 

<213> Homo sapiens 

<400> 14 

aaagaagatc tggatattgt atgtgaaagg ttcactacta gtaaactgca gtcctttgag 6 0 
gatttagcca gtatttctac ctatggcttt cgaggtgag 99 

<210> 15 

<211> 33 

<212> DNA 

<213> Homo sapiens 

<400> 15 

gtaagctaaa gattcaagaa atgtgtaaaa tat 3 3 

<210> 16 

<211> 19 

<212> DNA 

<213> Homo sapiens 



3 



<400> 16 

cctcctgtga tgacattgt 



19 



<210> 17 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 17 

aacctttccc tttggtgagg 

<210> 18 

<211> 59 

<212> DNA 

<213> Homo sapiens 

<400> 18 

tgacagtggg tgacccagca gtgagttttt ctttcagtct attttctttt cttccttag 59 

<210> 19 

<211> 74 

<212> DNA 

<213> Homo sapiens 

<400> 19 

gctttggcca gcataagcca tgtggctcat gttactatta caacgaaaac agctgatgga 60 

74 

aagtgtgcat acag 

<210> 20 

<211> 53 

<212> DNA 

<213> Homo sapiens 

<400> 20 

gtatagtgct gacttctttt actcatatat attcattctg aaatgtattt tgg 53 

<210> 21 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 21 

20 



gcctaggtct cagagtaatc 



<210> 22 
<211> 7 
<212> DNA 



4 



<213> Homo sapiens 

<400> 22 
ttgatat 

<210> 23 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 23 

gattttctct tttccccttg gg 

<210> 24 

<211> 51 

<212> DNA 

<213> Homo sapiens 

<400> 24 

attagtatct atctctctac tggatattaa 

<210> 25 

<211> 73 

<212> DNA 

<213> Homo sapiens 

<400> 25 

agcaagttac tcagatggaa aactgaaagc 
gacccagatc acg 

<210> 26 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 26 

gtaagaatgg tacatgggac a 

<210> 27 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 27 

gtaaattgtt gaagctttgt ttg 

<210> 28 
<211> 26 



22 



tttgttatat tttctcatta g 



ccctcctaaa ccatgtgctg gcaatcaagg 60 

73 



21 



5 



<212> DNA 

<213> Homo sapiens 



<400> 28 

gggttttatt ttcaagtact tctatg 26 

<210> 29 

<211> 31 

<212> DMA 

<213> Homo sapiens 

<400> 29 

aatttacaag aaaaatcaat cttctgttca g 31 

<210> 30 

<211> 92 

<212> DNA 

<213> Homo sapiens 

<400> 30 

gtggaggacc ttttttacaa catagccacg aggagaaaag ctttaaaaaa tccaagtgaa 60 
gaatatggga aaattttgga agttgttggc ag 92 

<210> 31 

<211> 61 

<212> DNA 

<213> Homo sapiens 

<400> 31 

gtacagtcca aaatctggga gtgggtctct gagatttgtc atcaaagtaa tgtgttctag 60 
t 61 

<210> 32 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 32 

gctcatacat tgaacagttg ctgagc 2 6 

<210> 33 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 33 

ctagtgtgtg tttttggc 18 



<210> 34 
<211> 35 
<212> DNA 

<213> Homo sapiens 
<400> 34 

aactcttttc ttactctttt gtttttcttt tccag 35 

<210> 35 

<211> 43 

<212> DNA 

<213> Homo sapiens 

<400> 35 

gtattcagta cacaatgcag gcattagttt ctcagttaaa aaa 43 

<210> 36 

<211> 70 

<212> DNA 

<213> Homo sapiens 

<400> 36 

gtaagttctt ggtttatggg ggatggtttt gttttatgaa aagaaaaaag gggattttta 60 
atagtttgct 

<210> 37 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 37 

1 8 

ggtggagata aggttatg 

<210> 38 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 38 

ctcagccatg agacaataaa tec 

<210> 39 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 39 

27 



ttgtgtcttc tgctgtttgt ttatcag 



7 



<210> 40 
<211> 89 
<212> DNA 

<213> Homo sapiens 
<400> 40 

caaggagaga cagtagctga tgttaggaca ctacccaatg cctcaaccgt ggacaatatt 
cgctccatct ttggaaatgc tgttagtcg 

<210> 41 

<2ll> 57 

<212> DNA 

<213> Homo sapiens 

<400> 41 

gtatgtcgat aacctatata aaaaaatctt ttacatttat tatcttggtt tatcatt 

<210> 42 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 42 

ccatcacatt atttgggaac c 

<210> 43 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 43 

caaaagcttc agaatctc 

<210> 44 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 44 
ttttctaata g 

<210> 45 

<211> 113 

<212> DNA 

<213> Homo sapiens 

<400> 45 



8 



agaactgata gaaattggat gtgaggataa aaccctagcc ttcaaaatga atggttacat 6 
atccaatgca aactactcag tgaagaagtg catcttctta ctcttcatca acc 1 



<210> 46 

<211> 28 

<212> DNA 

<213> Homo sapiens 



<400> 46 

gtaagttaaa aagaaccaca tgggaaat 

<210> 47 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 47 

ccactcacag gaaacaccca cag 

<210> 48 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 48 

catgactttg tgtgaatgta cacc 



<210> 49 

<211> 56 

<212> DNA 

<213> Homo sapiens 

<400> 49 

tgtgacctca cccctcagga cagttttgaa ctggttgctt tctttttatt gtttag 56 



<210> 50 

<211> 94 

<212> DNA 

<213> Homo sapiens 



<400> 50 

atcgtctggt agaatcaact tccttgagaa aagccataga aacagtgtat gcagcctatt 6 
tgcccaaaaa cacacaccca ttcctgtacc tcag ^ 

<210> 51 

<211> 41 

<212> DNA 

<213> Homo sapiens 



9 



<400> 51 

gtaatgtagc accaaactcc tcaaccaaga ctcacaagga a 



41 



<210> 52 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 52 

cagatgttct atcaggctct cctc 24 

<210> 53 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 53 

gggctttttc tccccctccc 

<210> 54 

<211> 39 

<212> DNA 

<213> Homo sapiens 

<400> 54 

actatctaag gtaattgttc tctcttattt tcctgacag 3 9 

<210> 55 

<211> 154 

<212> DNA 

<213> Homo sapiens 

<400> 55 

tttagaaatc agtccccaga atgtggatgt taatgtgcac cccacaaagc atgaagttca 60 

cttcctgcac gaggagagca tcctggagcg ggtgcagcag cacatcgaga gcaagctcct 12 0 

gggctccaat tcctccagga tgtacttcac ccag 154 

<210> 56 

<211> 58 

<212> DNA 

<213> Homo sapiens 

<400> 56 

gtcagggcgc ttctcatcca gctacttctc tggggccttt gaaatgtgcc cggccaga 5 8 

<210> 57 
<211> 18 



10 



<212> DNA 

<213> Homo sapiens 

<400> 57 

cgtgagagcc cagatttt 

<210> 58 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 58 

aattatacct catactagc 

<210> 59 

<211> 116 

<212> DNA 

<213> Homo sapiens 

<400> 59 

ttctttctta gtactgctcc atttggggac ctgtatatct atacttctta ttctgagtct 60 
ctccactata tatatatata tatatatatt tttttttttt ttttttttta atacag 116 

<210> 60 

<211> 371 

<212> DNA 

<213> Homo sapiens 

<400> 60 

actttgctac caggacttgc tggcccctct ggggagatgg ttaaatccac aacaagtctg 60 

acctcgtctt ctacttctgg aagtagtgat aaggtctatg cccaccagat ggttcgtaca 12 0 

gattcccggg aacagaagct tgatgcattt ctgcagcctc tgagcaaacc cctgtccagt 180 

cagccccagg ccattgtcac agaggataag acagatattt ctagtggcag ggctaggcag 24 0 

caagatgagg agatgcttga actcccagcc cctgctgaag tggctgccaa aaatcagagc 3 00 

ttggaggggg atacaacaaa ggggacttca gaaatgtcag agaagagagg acctacttcc 3 60 

agcaacccca g •^'^^ 

<210> 61 

<211> 29 

<212> DNA 

<213> Homo sapiens 

<400> 61 

gtatggcctt ttgggaaaag tacagccta 2 9 

<210> 62 
<211> 23 
<212> DNA 



11 



<213> Homo sapiens 



<400> 62 

cctcctttat tctgtaataa aac 23 

<210> 63 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 63 

tgcaacccac aaaatttggc 

<210> 64 

<211> 53 

<212> DNA 

<213> Homo sapiens 

<400> 64 

taagtttaaa aacaagaata ataatgatct gcacttcctt ttcttcattg cag 53 

<210> 65 

<211> 149 

<212> DNA 

<213> Homo sapiens 

<400> 65 

aaagagacat cgggaagatt ctgatgtgga aatggtggaa gatgattccc gaaaggaaat 60 
gactgcagct tgtacccccc ggagaaggat cattaacctc actagtgttt tgagtctcca 12 0 
ggaagaaatt aatgagcagg gacatgagg 14 9 

<210> 66 

<211> 50 

<212> DNA 

<213> Homo sapiens 

<400> 66 

gtacgtaaac gctgtggcct gcctgggatg catagggcct caactgccaa 50 

<210> 67 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 67 

ggttttggaa atggagaaag 2 0 

<210> 68 



12 



<211> 18 

<212> DNA 

<213> Homo sapiens 



<400> 68 

tggtgtctct agttctgg 

<210> 69 

<211> 82 

<212> DNA 

<213> Homo sapiens 

<400> 69 

tgcctggtgc tttggtcaat gaagtggggt 
ggttttattt tttgttttgc ag 

<210> 70 

<211> 109 

<212> DNA 

<213> Homo sapiens 

<400> 70 

ttctccggga gatgttgcat aaccactcct 
tggcacagca tcaaaccaag ttataccttc 

<210> 71 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 71 

gtaaatcagc tgagtgtgtg aacaa 

<210> 72 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 72 

gcagagctac tacaacaatg 

<210> 73 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 73 

cccatttgtc ccaactgg 



tggtaggatt ctattactta cctgtttttt 60 

82 



tcgtgggctg tgtgaatcct cagtgggcct 6 0 
tcaacaccac caagcttag 10 



25 



20 



13 



<210> 74 

<211> 57 

<212> DNA 

<213> Homo sapiens 

<400> 74 

ttgtatctca agcatgaatt cagcttttcc ttaaagtcac ttcattttta ttttcag 

<210> 75 

<211> 64 

<212> DMA 

<213> Homo sapiens 

<400> 75 

tgaagaactg ttctaccaga tactcattta tgattttgcc aattttggtg ttctcaggtt 
atcg 

<210> 76 
<211> 23 
<212> DNA 

<213> Homo sapiens 
<400> 76 

gtaagtttag atccttttca ctt 

<210> 77 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 77 

ctgacatttc aactgaccg 

<210> 78 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 78 

catttggatg ctccgttaaa gc 

<210> 79 

<211> 30 

<212> DNA 

<213> Homo sapiens 

<400> 79 



14 



ttgctccttc atgttcttgc ttcttcctag 
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<210> 80 

<211> 165 

<212> DNA 

<213> Homo sapiens 

<400> 80 

gagccagcac cgctctttga ccttgccatg cttgccttag atagtccaga gagtggctgg 60 
acagaggaag atggtcccaa agaaggactt gctgaataca ttgttgagtt tctgaagaag 12 0 
aaggctgaga tgcttgcaga ctatttctct ttggaaattg atgag 165 

<210> 81 

<211> 36 

<212> DNA 

<213> Homo sapiens 

<400> 81 

gtgtgacagc cattcttata cttctgttgt attctc 36 

<210> 82 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 82 

caaataaaat ttccagccgg gtg 2 3 

<210> 83 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 83 

ggaaaggcac tggagaaatg gg 

<210> 84 

<211> 59 

<212> DNA 

<213> Homo sapiens 

<4G0> 84 

atttgtttaa actatgacag cattatttct tgttcccttg tcctttttcc tgcaagcag 59 

<210> 85 

<211> 93 

<212> DNA 

<213> Homo sapiens 



<400> 85 

gaagggaacc tgattggatt accccttctg attgacaact atgtgccccc tttggaggga 60 
ctgcctatct tcattcttcg actagccact gag 93 



<210> 86 

<211> 29 

<212> DMA 

<213> Homo sapiens 

<400> 86 

gtcagtgatc aagcagatac taagcattt 

<210> 87 

<211> 25 

<212> DNA 

<213> Homo sapiens 
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<400> 87 

cggtacatgc atgtgtgctg gaggg 2 5 

<210> 88 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 88 

taagtagtct gtgatctccg 2 0 

<210> 89 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<400> 89 

tttagaatga gaatgtttaa attcgtacct attttgaggt attgaatttc tttggaccag 60 

<210> 90 
<211> 114 
<212> DNA 

<213> Homo sapiens 
<400> 90 

gtgaattggg acgaagaaaa ggaatgtttt gaaagcctca gtaaagaatg cgctatgttc 6 
tattccatcc ggaagcagta catatctgag gagtcgaccc tctcaggcca gcag 1 



<210> 91 
<211> 35 



<212> DNA 

<213> Homo sapiens 



<400> 91 

gtacagtggt gatgcacact ggcaccccag gacta 35 

<210> 92 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 92 

ggacaggacc tcatacat 18 

<210> 93 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 93 

gacaccagtg tatgttgg 18 

<210> 94 

<211> 42 

<212> DNA 

<213> Homo sapiens 

<400> 94 

gatgcaaaca gggaggctta tgacatctaa tgtgttttcc ag 42 

<210> 95 
<211> 168 
<212> DNA 

<213> Homo sapiens 
<400> 95 

agtgaagtgc ctggctccat tccaaactcc tggaagtgga ctgtggaaca cattgtctat 60 
aaagccttgc gctcacacat tctgcctcct aaacatttca cagaagatgg aaatatcctg 12 0 
cagcttgcta acctgcctga tctatacaaa gtctttgaga ggtgttaa 168 

<210> 96 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 96 

atatggttat ttatgcactg t 21 
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<210> 97 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 97 

gggatgtgtt cttctttctc 20 

<210> 98 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<400> 98 

tgtattccga tacaaagtgt tgtatcaaag tgtgatatac aaagtgtacc aacataagtg 60 



<210> 99 

<211> 31 

<212> DNA 

<213> Homo sapiens 

<400> 99 

ggcgggaaac agcttagtgg gtgtggggtc g 

<210> 100 
<211> 20 
<212> DNA 
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<213> Homo sapiens 
<400> 100 

cgcattttct tcaaccagga 2 0 

<210> 101 
<211> 17 
<212> DNA 

<213> Homo sapiens 
<400> 101 

ggtgaggagg tttcgac 17 

<210> 102 
<21l> 211 
<212> DNA 
<213> Homo sapiens 

<400> 102 

atggcggtgc agccgaagga gacgctgcag ttggagagcg cggccgaggt cggcttcgtg 6 0 
cgcttctttc agggcatgcc ggagaagccg accaccacag tgcgcctttt cgaccggggc 12 0 

18 



gacttctata cggcgcacgg cgaggacgcg ctgctggccg cccgggaggt gttcaagacc 18 0 
cagggggtga tcaagtacat ggggccggca g 211 

<210> 103 

<211> 14 

<212> DNA 

<213> Homo sapiens 

<400> 103 

gtgagggccg ggac 14 

<210> 104 

<211> 17 

<212> DNA 

<213> Homo sapiens 

<400> 104 

ggcgcgtgct ggggagg 17 

<210> 105 

<211> 21 

<212> DMA 

<213> Homo sapiens 

<400> 105 

gtccagctaa tacagtgctt g 21 

<210> 106 

<211> 46 

<212> DNA 

<213> Homo sapiens 

<400> 106 

aacatgtaat atctcaaatc tgtaatgtac tttttttttt tttaag 46 

<210> 107 
<211> 155 
<212> DNA 
<213> Homo sapiens 

<400> 107 

gagcaaagaa tctgcagagt gttgtgctta gtaaaatgaa ttttgaatct tttgtaaaag 60 

atcttcttct ggttcgtcag tatagagttg aagtttataa gaatagagct ggaaataagg 12 0 

catccaagga gaatgattgg tatttggcat ataag 155 

<210> 108 
<211> 33 
<212> DNA 
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<213> Homo sapiens 



<400> 108 

gtaattatct tcctttttaa tttacttatt ttt 33 

<210> 109 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 109 

ttaagagtag aaaaataaaa atgtg 25 

<210> 110 

<211> 32 

<212> DNA 

<213> Homo sapiens 

<400> 110 

attaataagg ttcatagagt ttggattttt cc 32 

<210> 111 
<211> 26 
<212> DNA 

<213> Homo sapiens 
<400> 111 

ttataaaatt ttaaagtatg ttcaag 26 

<210> 112 

<211> 36 

<212> DNA 

<213> Homo sapiens 

<400> 112 

agtttgttaa atttttaaaa ttttattttt acttag 36 

<210> 113 
<211> 279 
<212> DNA 
<213> Homo sapiens 

<400> 113 

gcttctcctg gcaatctctc tcagtttgaa gacattctct ttggtaacaa tgatatgtca 60 

gcttccattg gtgttgtggg tgttaaaatg tccgcagttg atggccagag acaggttgga 12 0 

gttgggtatg tggattccat acagaggaaa ctaggactgt gtgaattccc tgataatgat 180 

cagttctcca atcttgaggc tctcctcatc cagattggac caaaggaatg tgttttaccc 240 

ggaggagaga ctgctggaga catggggaaa ctgagacag 2 79 
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<210> 114 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 114 

gtaagcaaat tgagtctagt gat 23 

<210> 115 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 115 

agaggagatt ccaggcctag gaaag 2 5 

<210> lie 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 116 

tctttaattg acatgatact g 21 

<210> 117 
<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 117 

tttttgcttt tcttattcct tttc 24 

<210> 118 

<211> 33 

<212> DNA 

<213> Homo sapiens 

<400> 118 

tcatagtagt ttaaactatt tctttcaaaa tag 33 

<210> 119 

<211> 147 

<212> DNA 

<213> Homo sapiens 

<400> 119 

ataattcaaa gaggaggaat tctgatcaca gaaagaaaaa aagctgactt ttccacaaaa 60 
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gacatttatc aggacctcaa ccggttgttg 
gctgtattgc cagaaatgga gaatcag 

<210> 120 

<211> 85 

<212> DMA 

<213> Homo sapiens 

<400> 120 

gtacatggat tataaatgtg aattacaata 
ataatatgta aactatagtg acttt 

<210> 121 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 121 

ttagaaggat atttctgtca 

<210> 122 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 122 
actggcacca 

<210> 123 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 123 

gtggtataga aatcttcgat tttt 

<210> 124 

<211> 16 

<212> DNA 

<213> Homo sapiens 

<400> 124 
aaattcttaa ttttag 

<210> 125 

<211> 150 

<212> DNA 

<213> Homo sapiens 



aaaggcaaaa agggagagca gatgaatagt 12 0 

147 



tatataatgt aaatatgtaa tatataataa 60 

85 



20 



10 



24 



22 



<400> 125 

gttgcagttt catcactgtc tgcggtaatc aagtttttag aactcttatc agatgattcc 60 

aactttggac agtttgaact gactactttt gacttcagcc agtatatgaa attggatatt 12 0 

gcagcagtca gagcccttaa cctttttcag 150 

<2lO> 126 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 126 

gtaaaaaaaa aaaaaaaaaa aaa 2 3 

<210> 127 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 127 

aaaagggtta aaaatgttga tt 22 

<210> 128 

<211> 13 

<212> DNA 

<213> Homo sapiens 

<400> 128 

ttaaaaaatg ttt 13 

<210> 129 
<211> 23 
<212> DNA 

<213> Homo sapiens 
<400> 129 

cattgacata tactgaagaa get 23 

<210> 130 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 130 

tataaaggag ctaaaatatt tggaaat 2 7 

<210> 131 
<211> 43 
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<212> DNA 

<213> Homo sapiens 



^1 iir 



<400> 131 

attatacttg gattagataa ctagctttaa atgggtgtat ttt 43 

<210> 132 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 132 

ttcactaatg agcttgccat tc 22 

<210> 133 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 133 

tttctatttt attttttgtt tactag 26 

<210> 134 
<211> 134 
<212> DNA 
<213> Homo sapiens 

<400> 134 

ggttctgttg aagataccac tggctctcag tctctggctg ccttgctgaa taagtgtaaa 60 

acccctcaag gacaaagact tgttaaccag tggattaagc agcctctcat ggataagaac 120 

agaatagagg agag 134 

<210> 135 

<211> 44 

<212> DNA 

<213> Homo sapiens 

<400> 135 

gtatgttatt agtttatact ttcgttagtt ttatgtaacc tgca 44 

<210> 136 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 136 

gttacccaca tgattatacc 20 
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<210> 137 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 137 

cttacgtgct tagttgataa 20 

<210> 138 

<211> 53 

<212> DNA 

<213> Homo sapiens 

<400> 138 

attttaattt tatactaaaa tattttacat taattcaagt taatttattt cag 53 

<210> 139 
<211> 200 
<212> DNA 
<213> Homo sapiens 

<400> 139 

attgaattta gtggaagctt ttgtagaaga tgcagaattg aggcagactt tacaagaaga 6 0 

tttacttcgt cgattcccag atcttaaccg acttgccaag aagtttcaaa gacaagcagc 12 0 

aaacttacaa gattgttacc gactctatca gggtataaat caactaccta atgttataca 180 
ggctctggaa aaacatgaag 2 00 

<210> 140 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 140 

gtaacaagtg attttgtttt tttg 24 
<210> 141 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 141 

ttttccttca actcatacaa tata 24 

<210> 142 

<211> 26 

<212> DNA 

<213> Homo sapiens 
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<400> 142 

tttgtattct gtaaaatgag atcttt 
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<210> 143 

<211> 30 

<212> DNA 

<213> Homo sapiens 

<400> 143 

ttatttgttt gttttactac tttcttttag 30 

<210> 144 
<211> 110 
<212> DNA 
<213> Homo sapiens 

<400> 144 

gaaaacacca gaaattattg ttggcagttt ttgtgactcc tcttactgat cttcgttctg 60 
acttctccaa gtttcaggaa atgatagaaa caactttaga tatggatcag 110 

<210> 145 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 145 

gtatgcaata tactttttaa tttaag 26 

<210> 146 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 146 

cagtagttat ttttaaaaag caaag 2 5 

<210> 147 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 147 

ctttacccat tatttatagg att 23 

<210> 148 

<211> 23 

<212> DNA 

<213> Homo sapiens 
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<400> 148 

ttgtcacttt gttctgtttg cag 



23 



<210> 149 
<211> 124 
<212> DNA 
<213> Homo sapiens 

<400> 149 

gtggaaaacc atgaattcct tgtaaaacct tcatttgatc ctaatctcag tgaattaaga 60 
gaaataatga atgacttgga aaagaagatg cagtcaacat taataagtgc agccagagat 12 0 
cttg 124 

<210> 150 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 150 

gtaagaatgg gtcattggag 2 0 

<210> 151 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 151 

gttggaataa ttcttttgtc tat 23 

<210> 152 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 152 

tagtaggtat ttatggaata ctttt 25 

<210> 153 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 153 

tcttttcttc ttgtttatca ag 22 

<210> 154 
<211> 151 



27 



<212> DNA 

<213> Homo sapiens 

<400> 154 

gcttggaccc tggcaaacag attaaactgg attccagtgc acagtttgga tattactttc 
gtgtaacctg taaggaagaa aaagtccttc gtaacaataa aaactttagt actgtagata 
tccagaagaa tggtgttaaa tttaccaaca g 

<210> 155 

<211> 36 

<212> DNA 

<213> Homo sapiens 

<400> 155 

gtttgtaagt cattattata tttttaaccc tttatt 

<210> 156 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 156 

aattccctaa atgctctaac a 

<210> 157 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 157 

cattgcttct agtacacatt t 

<210> 158 

<211> 36 

<212> DNA 

<213> Homo sapiens 

<400> 158 

taatattttt aataaaactg ttatttcgat ttgcag 

<210> 159 

<211> 98 

<212> DNA 

<213> Homo sapiens 

<400> 159 

caaattgact tctttaaatg aagagtatac caaaaataaa acagaatatg aagaagccca 
ggatgccatt gttaaagaaa ttgtcaatat ttcttcag 
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<210> 160 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 160 

gtaaacttaa tagaactaa 

<210> 161 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 161 

taatgttctg aatgtcacct g 

<210> 162 
<211> 21 
<212> DNA 

<213> Homo sapiens 



<400> 162 

tcagtattcc tgtgtacatt t 21 

<210> 163 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 163 

tctgttttta tttttataca g 21 



<210> 164 

<211> 246 

<212> DNA 

<213> Homo sapiens 



<400> 164 

gctatgtaga accaatgcag acactcaatg 
gctttgctca cgtgtcaaat ggagcacctg 
aaggacaagg aagaattata ttaaaagcat 
aaattgcatt tattcctaat gacgtatact 
ttactg 



atgtgttagc tcagctagat gctgttgtca 60 
ttccatatgt acgaccagcc attttggaga 120 
ccaggcatgc ttgtgttgaa gttcaagatg 180 
ttgaaaaaga taaacagatg ttccacatca 240 

246 



<210> 165 
<211> 16 
<212> DNA 
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<213> Homo sapiens 



<400> 165 

gtaaaaaacc tggttt 16 

<210> 166 
<211> 19 
<212> DMA 

<213> Homo sapiens 
<400> 166 

ttgggctttg tgggggtaa 15 

<210> 167 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 167 

cgattaatca tcagtgtac 19 

<210> 168 

<211> 78 

<212> DNA 

<213> Homo sapiens 

<400> 168 

agtttaggac taacaatcca tttattagta gcagaaagaa gtttaaaatc ttgctttctg 60 
atataatttg ttttgtag 78 

<210> 169 

<211> 205 

<212> DNA 

<213> Homo sapiens 

<400> 169 

gccccaatat gggaggtaaa tcaacatata ttcgacaaac tggggtgata gtactcatgg 60 

cccaaattgg gtgttttgtg ccatgtgagt cagcagaagt gtccattgtg gactgcatct 12 0 

tagcccgagt aggggctggt gacagtcaat tgaaaggagt ctccacgttc atggctgaaa 18 0 

tgttggaaac tgcttctatc ctcag 205 

<210> 170 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 170 

gtaagtgcat ctcctagtcc ctt 23 
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<210> 171 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 171 

gaagatagaa atgtatgtct ctg 23 

<210> 172 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 172 

ccacatttta tgtgatggga a 21 

<210> 173 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 173 

atttcatgta attatgtgct tcag 24 

<210> 174 
<211> 248 
<212> DNA 

<213> Homo sapiens 
<400> 174 

gtctgcaacc aaagattcat taataatcat agatgaattg ggaagaggaa cttctaccta 60 
cgatggattt gggttagcat gggctatatc agaatacatt gcaacaaaga ttggtgcttt 12 0 
ttgcatgttt gcaacccatt ttcatgaact tactgccttg gccaatcaga taccaactgt 180 
taataatcta catgtcacag cactcaccac tgaagagacc ttaactatgc tttatcaggt 240 
gaagaaag 24 8 

<210> 175 

<211> 56 

<212> DNA 

<213> Homo sapiens 

<400> 175 

gtatgtacta ttggagtact ctaaattcag aacttggtaa tgggaaactt actacc 56 

<210> 176 
<211> 19 
<212> DNA 
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<213> Homo sapiens 



<400> 176 

cttctcatgc tgtcccctc 15 

<210> 177 

<211> 23 

<212> DMA 

<213> Homo sapiens 

<400> 177 

acgcttcccc aaatttctta tag 23 

<210> 178 

<211> 176 

<212> DMA 

<213> Homo sapiens 

<400> 178 

gtgtctgtga tcaaagtttt gggattcatg ttgcagagct tgctaatttc cctaagcatg 60 

taatagagtg tgctaaacag aaagccctgg aacttgagga gtttcagtat attggagaat 12 

cgcaaggata tgatatcatg gaaccagcag caaagaagtg ctatctggaa agagag 17 

<210> 179 

<211> 17 

<212> DMA 

<213> Homo sapiens 

<400> 179 

gtttgtcagt ttgtttt i7 

<210> 180 

<211> 21 

<212> DMA 

<213> Homo sapiens 

<400> 180 

catagtttaa cttagcttct c 21 

<210> 181 

<211> 21 

<212> DMA 

<213> Homo sapiens 

<400> 181 

attactcatg ggacattcac a 2 



<210> 182 



<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 182 
tgtgtttcag 

<210> 183 
<211> 171 
<212> DNA 
<213> Homo sapiens 

<400> 183 

caaggtgaaa aaattattca ggagttcctg tccaaggtga aacaaatgcc ctttactgaa 60 

atgtcagaag aaaacatcac aataaagtta aaacagctaa aagctgaagt aatagcaaag 12 0 

aataatagct ttgtaaatga aatcatttca cgaataaaag ttactacgtg a 171 

<210> 184 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 184 

atcccagtaa tggaatgaag 

<210> 185 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 185 

tgtaaaacga cggccagtct gaggtgattg gctgaag 37 

<210> 186 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 186 

ggaaacagct atgaccatgc cgttaagtcg tagccctt 3 8 

<210> 187 
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<211> 37 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 187 

tgtaaaacga cggccagtcg attaatcatc agtgtac 

<210> 188 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 188 

ggaaacagct atgaccatgc agagacatac atttctatct tc 

<210> 189 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 189 

tgtaaaacga cggccagtcg cattttcttc aaccagga 

<210> 190 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 190 

ggaaacagct atgaccatgc ctccccagca cgcgcc 

<210> 191 
<211> 23 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 191 

ggccattgtc acagaggata aga 

<210> 192 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 192 

acacagccca cgaaggagtg 
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